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Virtual Image Artifact Detection 

Related Applications 

[0001] This application is related to U.S. Patent No. 10/763,453, entitled 
"Virtual Camera Translation" and filed on January 23, 2004, which is incorporated 
herein by reference for all that it discloses and teaches. 

[0002] This application is also related to U.S. Patent Application No. 

[MSDocket No. 308954.1], entitled "Virtual Image Generation" and 

filed on April 16, 2004. 

Technical Field 

[0003] The invention relates generally to digital image processing, and more 
particularly to virtual image artifact detection. 

Background 

[0004] Digital video cameras are useful in both consumer and professional 
contexts. Generally, digital video cameras capture sequences of digital images, which 
may then be transferred to a computer system for display or processing or to a storage 
device for storage. Alternatively, digital still cameras may be employed to capture 
distinct, non-video digital images. 

[0005] In some applications, stereo cameras may be employed to generate 
synthetic virtual images from a variety of viewpoints. For example, in video 
conferencing application, a single video camera can capture images of a conference 
participant. However, the participant's gaze tends to align with a display window 
presented in a monitor by the video conferencing system, while the video camera is 
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typically mounted to the side of or above or below the display window, resulting in a 
misalignment between the participant's gaze and the captured video (e.g., capturing 
images of the side of the participant's head rather than a "straight-on" view). 
Accordingly, virtual images of straight-on, cyclopean views may be synthesized from 
stereo images captured by stereo cameras that are offset relative to the display 
window. It should be understood that other applications of virtual image generation 
may be employed outside the field of video conferencing. 

[0006] However, existing methods of synthesizing cyclopean virtual images 
from stereo images often produce artifacts (e.g., streaks, blocks, and halos) that 
diminish the quality of the resulting virtual images. For example, an exemplary type 
of artifact results from mismatched mixes of foreground and background colors in 
corresponding pixels of stereo images. Because of the pixel disparities introduced by 
the relative positions and angles of the stereo cameras, it is common that the 
background colors contributing to the color mixing in one pixel of one stereo image 
are different from the background colors contributing to the color mixing in a 
corresponding pixel of another stereo image. A result of this color mixing difference 
is a tendency to inaccurately synthesize a virtual image pixel where color mixing 
discrepancies exists, thereby introducing incorrect transparency effects, streaking, and 
aliasing into the resulting virtual image. 

Summary 

[0007] Implementations described and claimed herein address the foregoing 
problems by detecting the artifacts in a virtual image generated from stereo images. A 
disparity map is generated from the stereo images, and individual projected images are 
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determined based on the disparity map and the corresponding stereo images. A 
difference map is then computed between the individual projected images to indicate 
the artifacts. 

[0008] Having detected an artifact in the virtual image, a source patch in the 
virtual image is defined relative to the artifact. A target patch is generated using a 
split-patch search technique as a composite of a background exemplar patch and a 
foreground exemplar patch. Each exemplar patch may be identified from an image 
patch selected from at least one of the input stereo images. The source patch of the 
virtual image is replaced with the target patch to correct the detected artifact. 

[0009] In some implementations, articles of manufacture are provided as 
computer program products. One implementation of a computer program product 
provides a computer program storage medium readable by a computer system and 
encoding a computer program. Another implementation of a computer program 
product may be provided in a computer data signal embodied in a carrier wave by a 
computing system and encoding the computer program. 

[0010] The computer program product encodes a computer program for a 
computer process executing on a computer system. The computer process detects one 
or more artifacts in a virtual image synthesized from stereo images. A projected 
image is generated for each of the stereo images by projecting each stereo image into 
a target viewpoint based on a disparity map of the stereo images. Differences between 
corresponding pixels in the projected images are computed to produce a difference 
map. A computed difference exceeding a threshold indicates an artifact. 
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[0011] In another implementation, a method is provided that detects one or 
more artifacts in a virtual image synthesized from stereo images. A disparity map is 
generated from the stereo images. A projected image is generated for each of the 
stereo images by projecting each stereo image into a target viewpoint based on the 
disparity map. Color-distances between corresponding pixels in the projected images 
are computed to produce a difference map. One or more locations in the difference 
map associated with a computed color-distance exceeding a threshold are designated 
as the one or more artifacts. 

[0012] In another implementation, a system for detecting one or more artifacts 
in a virtual image synthesized from stereo images is provided. An image warp 
module generates a projected image for each of the stereo images by projecting each 
stereo image into a target viewpoint based on a disparity map of the stereo images. 
An image distancing module computes color-distances between corresponding pixels 
in the projected images to produce a difference map. A thresholding module 
designates one or more locations in the difference map associated with a computed 
color-distance exceeding a threshold as the one or more artifacts. 

[0013] Other implementations are also described and recited herein. 

Brief Descriptions of the Drawings 

[0014] FIG. 1 illustrates an exemplary artifact-correcting virtual image 
generation system. 

[0015] FIG. 2 illustrates an exemplary video conferencing system configuration 
for generating an artifact-corrected virtual image. 
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[0016] FIG. 3 illustrates a cause of color mixing discrepancies in corresponding 
pixels of stereo images. 

[0017] FIG. 4 illustrates a color mixing discrepancy in corresponding pixels of 
stereo images. 

[0018] FIG. 5 depicts an exemplary system for removing artifacts from a 
virtual image. 

[0019] FIG. 6 depicts operations in an exemplary artifact detection process. 

[0020] FIG. 7 illustrates splitting a virtual image patch into foreground and 
background components based on a disparity map in an exemplary implementation. 

[0021] FIG. 8 illustrates an exemplary split-patch search of candidate exemplar 
patches. 

[0022] FIG. 9 illustrates results of an exemplary selection of candidate 
exemplar patches. 

[0023] FIG. 10 illustrates exemplary occlusion of a background region of a 

patch. 

[0024] FIG. 1 1 illustrates a system useful for implementing an embodiment of 
the present invention. 

Detailed Description 

[0025] FIG. 1 illustrates an exemplary artifact-correcting virtual image 
generation system 100. In the system 100, a left image 102 is captured by a camera 
mounted on the right side of the video display, as seen by the user. Likewise, a right 
image 104 is captured by a camera mounted on the left side of the video display, as 
seen by the user. As such, in both images, the user can be seen looking into the video 
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display, as opposed to looking directly at one of the cameras. The left and right 
images 102 and 104 are input to a virtual image synthesis module 106, which 
generates from the images 102 and 104 a virtual image 108 with gaze correction. The 
virtual image synthesis module 106 may also generate an occlusion map 110 and a 
stereo disparity map 112, as shown in FIG. 1. Exemplary methods of synthesizing 
virtual images from stereo images are described in previously-incorporated 
application U.S. Patent No. 10/763,453, entitled "Virtual Camera Translation". 

[0026] The virtual image 108, the occlusion map 110, and the stereo disparity 
map 112 generated by the virtual image synthesis module 106 are input to an artifact 
correction module 114 to generate the virtual image 116 with gaze correction and 
artifact correction. As a result, the virtual image 116 depicts a high-quality image of 
the user appearing to look directly into the camera. 

[0027] FIG. 2 illustrates an exemplary video conferencing system 200 
configuration for generating an artifact-corrected virtual image. A computer 
system 202 is coupled to a video display 204 having two cameras 206 and 208 
mounted on either side of the video display 204. It should be understood that other 
stereo placements of the cameras 206 and 208 (e.g., top/bottom, a four-camera 
configuration at each corner, etc.). A video window 210 displays a remote participant 
on the other end of the video conference session. 

[0028] In a configuration having only a single camera, the user typically 
focuses his or her eyes on the video window 210, while the single camera captures 
images of the user from one side or the other. As such, the captured images sent to the 
remote participant are primarily a side view of the user's head, not the desired 
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straight-on view of the user's face. The illustrated configuration, however, allows 
synthesis of a cyclopean virtual image from the captured left and right images of the 
user. It should be understood that cyclopean refers to the single virtual image. 
Furthermore, in one implementation, the cyclopean virtual image may be displayed at 
different video window locations on the display screen (i.e., cyclopean virtual image 
location is not limited to a central orientation relative to the stereo cameras) while 
maintaining alignment of the virtual camera with the user's gaze. Likewise, axial 
translation of the virtual image may also be achieved in an implementation. 

[0029] It should be understood that more than two cameras may also be used to 
generate a cyclopean virtual image. Likewise, the cameras may be in alternative 
orientations, such as at the top and bottom of the video display. For example, one 
configuration may include four cameras, each placed at a corner of the video display. 

[0030] FIG. 3 illustrates a cause of color mixing discrepancies in corresponding 
pixels of stereo images. A foreground object 300 is shown against a multi-color 
background 302, where the solid line 304 represents one color A and the broken 
line 306 represents another color B, and both background colors differ from the color 
of the foreground object. 

[0031] A right camera 308 captures a right image 310 that includes a pixel 312 
having color of the foreground object, a pixel 314 having the color of the background 
B, and a pixel 316 having a color mix of the foreground object and the background B. 
A left camera 318 captures a left image 320 that includes a pixel 322 having color of 
the foreground object, a pixel 324 having the color of the background A, and a 
pixel 326 having a color mix of the foreground object and the background A. As 
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discussed, the different color mixes of the corresponding pixels 316 and 326 may 
produce artifacts caused by mismatching the pixels during the virtual image synthesis 
process (e.g., generation of the disparity map is inaccurate because pixel 316 is not 
matched with pixel 326 because of the color mixing difference). 

[0032] FIG. 4 illustrates a color mixing discrepancy in corresponding pixels 
400 and 402 of stereo images 404 (left) and 406 (right). An image 408 represents a 
magnified version of region 410 of left image 404, and an image 412 represents a 
magnified version of region 414 of right image 406. As is evident from the magnified 
images 408 and 412, the corresponding pixels 400 and 402 include colors of the 
subject's shoulder and colors of the background. However, a dark brown door 
provides the background color contribution in the pixel 400 while a light tan wall 
provides the background color contribution in the pixel 402. As a result, the colors of 
the corresponding pixels 400 and 402 are different. 

[0033] When generating a virtual image pixel corresponding to the pixels 400 
and 402, the color mixing discrepancy between the two pixels can result in a 
mismatch as a disparity graph is generated along the epipolar lines associated with the 
two pixels. Color mixing mismatches typically result in artifacts in the virtual image 
(e.g., artifacts 500 in FIG. 5). However, such artifacts can be detected and corrected 
(e.g., reduced) using techniques described herein. 

[0034] FIG. 5 depicts an exemplary system for removing artifacts 500 from a 
virtual image 502. The artifacts 500 result from color mixing mismatches, such as the 
mismatch illustrated in FIG. 4. An artifact correction module 504 corrects the 
artifacts to provide a more suitable result image 506, in which the corresponding 
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corrected region 508 shows dramatically reduced artifact effects. Other artifact 
regions in the images also show improved results (see regions 510 and 512). 

[0035] FIG. 6 depicts operations in an exemplary artifact detection process. In 
the illustrated implementation, a left image 600 (denoted as /, ) and a right image 602 
(denoted as I r ) are processed (e.g., "epipolar-rectified") by a disparity map generator 
604 to generate from the two images a stereo disparity map 606 (denoted as D) and an 
occlusion map 607 (denoted as O). A couple of techniques for generating cyclopean 
virtual images from stereo images are described in previously incorporated U.S. 
Patent No. 10/763,453, entitled "Virtual Camera Translation", although other 
techniques may be used. The disparity map D is generated with respect to the 
coordinate system defined by the desired virtual viewpoint (e.g., the desired location 
of the virtual camera, such as the display window of a video conferencing system). A 
rough virtual image / (not shown) also generated from images /, and I r by techniques 
described in the previously incorporated U.S. Patent No. 10/763,453, entitled "Virtual 
Camera Translation" or any other known or equivalent method. The rough virtual 
image / includes artifacts, such those artifacts introduced by color mixing mismatches. 

[0036] The left image 600, the disparity map 606, and the occlusion map 607 
are input to a disparity-driven image warp module 608 to generate a projected left 
image 610 with occlusion regions (i.e., the solid light grey and solid dark grey regions 
to the left and right of the subject). Likewise, the right image 602, the disparity 
map 606, and the occlusion map 607 are input to a disparity-driven image warp 
module 612 to generate a projected right image 614 with occlusion regions. It should 
be understood that modules 608 and 612 are illustrated as distinct modules but could 
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be implemented as the same module taking different inputs (e.g., left and right 
images). The projected images 610 and 614 (denoted as for the projected left 
image and for the right projected right image) represent projections of the 
corresponding (e.g., left or right) images into a target viewpoint. The light grey and 
dark grey regions represent left and right half-occlusions. 

[0037] A pixel-wise color distance /") between the two projected images 

is computed by an aliasing-insensitive image distancing module 616 to indicate the 
location and entity of artifacts, as shown in the difference map 618. In one 
implementation, artifacts are represented as: 

the set A of pixels pe I such that d /") >A (e.g., A=5| 

[0038] Therefore, the image noise of the difference map 618 may be filtered 
(e.g., rf(/" ,/; v )>X), for example, by a thresholding module 620, to generate an artifact 

map 622. Assuming low levels of image noise, large values of d(l"J™} in the 

artifact map 622 occur in pixel locations where the virtual image synthesis algorithm 
has failed to correctly estimate the correct pixel correspondence between the two 
images /, and I r (e.g., because of color mixing mismatches or other matching errors). 

[0039] FIG. 7 illustrates splitting a virtual image patch into foreground and 
background filter components based on a disparity map in an exemplary 
implementation. As discussed, the artifact map 622 of FIG. 6 indicates pixel locations 
of artifacts in the rough virtual image generated by a virtual image synthesis module. 
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By virtue of the virtual image synthesis, a rough virtual image /, a disparity map D, 
and a set A of artifacts are available as inputs to an artifact correction operation. 

[0040] For each pixel pe A , a source patch O p centered at p is defined. For 

example, a source patch may include a set of neighboring pixels, such as a square 
region that is 5 pixels wide and 5 pixels high and centered at p. In the illustration, 
larger patches are shown to assist in the description. An artifact correction operation 
searches for a new target patch with which to replace the source patch O p . The 

new target patch *F p will be similar to the source patch <E> p but with the artifacts 

removed. Replacing a source patch O p with a new target patch W p for all pixels p 

removes detected artifacts throughout the entire image. 

[0041] In FIG. 7, an artifact point 700 (denoted as p) in a rough virtual 
image 702 (denoted as I) is selected. A source patch 704 (denoted as O p ) relative to 

the artifact 700 is defined in the rough virtual image 702. A disparity point 706 
(denoted again as p, because its location corresponds to the location of point 700) in a 
disparity map 708 (denoted as D) is identified as corresponding to the pixel p in the 
rough virtual image 702. A corresponding disparity patch 710 (denoted as D p ) is also 
identified. The disparity patch 710 is smoothed by a filter module 712 to provide a 
filtered disparity patch D p (not shown). By filtering the patch (e.g., by performing a 

low-pass smoothing operation on the patch), high frequency components of the 
disparity signal, which often arise from matching mistakes, are removed. Given these 
inputs, a foreground weight array and a background weight array Q p may be 
computed by a weighting module 714 as follows: 
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, D(q)-D m . 

p _ jyn p P * P 

with D m and D M representing the minimum and maximum values respectively of the 
disparities within the filtered disparity patch D p . A map 716 representing a 

foreground weighting array, and a map 718 representing a background weighting 
array are shown as outputs from the weighting module 714. The weighting arrays 
may also be referred to as "filter maps". 

[0042] FIG. 8 illustrates an exemplary split-patch search of candidate exemplar 
patches. The split-patch search searches for patches in the original right and left 
images that are most similar to the foreground and background portions of the source 
patch from the virtual image. In one implementation, the search is performed along 
the scan line (e.g., an epipolar line) corresponding to point p up to a distance 8y in 
each of the original images. An exemplary value of 8y may be computed as follows: 

c D{q) 

oy = max — - 1 - 

[0043] An artifact and a corresponding source patch 800 (shown in exploded 
view 802 and denoted as 3> p ) are selected from a rough virtual image 804. In the 

illustrated implementation, the artifact is detected by operations described with regard 
to FIG. 6, although other artifact detection methods may be employed. A foreground 
filter 806 (denoted as Q f p ) is used in combination with the source patch 802 and 

generic left/right-view patches to compute candidate foreground exemplar patches 

and . Likewise, background filter 806 (denoted as Q b p ) is used in combination with 
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the source patch 802 and generic left/right-view patches to compute candidate 
background exemplar patches and z£ . 

[0044] Choosing the right image 810 to exemplify split-patch search 
operations, generic patches along the epipolar line that includes the pixel p in the right 
image are evaluated against the source patch, as modified by the foreground and 
background filters (see the equations below), to identify the candidate exemplar 
patches. Selecting the minimum difference between the pixel-wise parameters below 
(e.g., *O p and£2£ */? q ) represents one exemplary implementation of such a split- 
patch search. The graphs 812 and 814 illustrate the evaluations used to select the 
candidate exemplar patches 816 (foreground) and 818 (background) from the right 
image 810. The process is repeated for the left image (not shown) 

[0045] Algorithmically, given appropriate search constraints, candidate 
exemplar patches may be determined as follows: 

/tf=arg min d(& p *a> p ,Q p ' ** q ) 

P x S y <q x <p x v V h 

^=arg min d(Q b p *O p ,Q p *tf q ) 

p x -5 y <q x <p x H k M 

^=arg min *O p ,Q p ' *L q ) 

p x <q x <p x +S y V V H 

4=arg min J(S2 p *O p ,Q p *L q ) 

with L q and R q representing the generic left and right view patches centered at the 
generic point qfay^Py along the epipolar line. The symbol "*" represents point-wise 
multiplication between images (or patches). In one implementation, the distance 
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d(n l3 n 2 ) between two generic patches Yl^ and n 2 is represented as the sum of squared 
differences (SSD) of pixel values, where the pixels of A are ignored. 

[0046] FIG. 9 illustrates results of an exemplary selection of candidate 
exemplar patches. A rough virtual image 900, a right image 902, and a left image 904 
are processed in a manner described with regard to FIG. 8, resulting in a source 
patch 906, a right candidate foreground exemplar patch 908, a right candidate 
background exemplar patch 910, a left candidate foreground exemplar patch 912, and 
a left candidate background exemplar patch 914. These resulting candidate exemplar 
patches are considered "candidates" because occlusion can render one of the 
background candidate exemplar patches meaningless. Indeed, the true background 
patch of O p has been occluded in the left view, thus the retrieved patch is 

meaningless. In contrast, the right candidate background exemplar patch contains 
the correct background information. 

[0047] Determining an uncontaminated background exemplar patch n* may 

be performed automatically by selecting the background patch that is most similar to 
the background of the source patch 906, using: 

[0048] The uncontaminated background exemplar patch n* is one component 
used to generate a new composite target patch . In addition, candidate exemplar 
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foreground patches and have been already determined for each pixel pe A 
An effective algorithm for compositing a target patch W p may be stated as: 

*p = r p *n£+(i-r p )*nj (i) 

where T p represents the transparency of the pixel p, represents the 

uncontaminated foreground exemplar patch about the pixel p, and U b p represents the 
uncontaminated background exemplar patch about the pixel p. However, n£ has not 
yet been computed and, in fact, an exact solution for does not appear available. 
As such, reasonable approximations are employed in one implementation. 

[0049] The candidate exemplar foreground patch l£ may be interpreted itself 
as a composite image. Its background (i.e., the poster on the back wall) is completely 
visible in the right input view 7 r . The background of can be extracted by the 
following search process: 

l^=axg imn d(Q p *l£,Q b p *R q ) 

[0050] The analogous background corresponding to the right foreground 

patch 7?p is occluded by the subject's head and therefore cannot be copied directly 
from either of the two input views. For example, in FIG. 10, the background of the 
right patch 1000 centered at point p (i.e., point 1002) is, of course, occluded in part 
by the. foreground of the right image I r 1004. Furthermore, the background of the 
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right patch R* 1000 cannot be found along the scan line 1006 of the left image 1008. 
Accordingly, an approximation of is computed by one of a variety of methods. 

[0051] An exemplary method of approximating may be implemented as 
follows. Given the right foreground patch R* and the background filter Q*, pixels of 
that belong to the background (i.e., the door in the example image) are extracted 
and a parametric surface model is fit (e.g., polynomial, spline, etc.) to the 
corresponding color values in RGB space. Then, the fitted surface model is used to 
extrapolate the colors of the pixels in the occluded portion of (i.e., behind the hair 

in the example). In one implementation applied to small patches (e.g., 5 pixels by 5 
pixels), extrapolation via a generic planar fit (generally not constant) has produced 
acceptable results. Symmetrical reasoning is applied when f£ is occluded. 

[0052] Based on the approximation operation, two foreground patches (L£ and 
Rf ) and two corresponding background patches ( z£ and R^ ) have been extracted. 

Therefore, the conventional compositing equation may be stated for each candidate 
foreground exemplar patch: 

z^r/n£+(i-r p )*z£ (2 ) 
^/=r p *n;+(i-r p )**J (3) 

with T p representing the transparencies and representing the uncontaminated 
foreground colors. Given that both background patches ( and R$ ) are known, then 
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both T p and Yl f ? may now be uniquely determined from Equations (2) and (3). 

Transparencies may be assumed to apply equally to each of the RGB channels. 

[0053] Noise and coincidently similar colors shared by corresponding pixels 
can corrupt the accurate recovery of transparencies and foreground colors. However, 
such recovery can be improved through incorporation of prior information (e.g., on 
the distribution of alpha (i.e., transparency) and color value). In one implementation, 
a Bayesian approach may be applied to regularize the alpha and color data. 
Alternative approaches are also available, including filtering the extracted alpha and 
color data in a depth -dependent fashion. 

[0054] As a result, given the foreground exemplar patch Tl£ , the transparency 

T p , and the background n* , the target patch may be computed according to Equation 
(1). 

[0055] The exemplary hardware and operating environment of FIG. 1 1 for 
implementing the invention includes a general purpose computing device in the form 
of a computer 20, including a processing unit 21, a system memory 22, and a system 
bus 23 that operatively couples various system components include the system 
memory to the processing unit 21. There may be only one or there may be more than 
one processing unit 21, such that the processor of computer 20 comprises a single 
central-processing unit (CPU), or a plurality of processing units, commonly referred to 
as a parallel processing environment. The computer 20 may be a conventional 
computer, a distributed computer, or any other type of computer; the invention is not 
so limited. 
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[0056] The system bus 23 may be any of several types of bus structures 
including a memory bus or memory controller, a peripheral bus, a switched fabric, 
point-to-point connections, and a local bus using any of a variety of bus architectures. 
The system memory may also be referred to as simply the memory, and includes read 
only memory (ROM) 24 and random access memory (RAM) 25. A basic input/output 
system (BIOS) 26, containing the basic routines that help to transfer information 
between elements within the computer 20, such as during start-up, is stored in ROM 
24. The computer 20 further includes a hard disk drive 27 for reading from and 
writing to a hard disk, not shown, a magnetic disk drive 28 for reading from or writing 
to a removable magnetic disk 29, and an optical disk drive 30 for reading from or 
writing to a removable optical disk 31 such as a CD ROM or other optical media. 

[0057] The hard disk drive 27, magnetic disk drive 28, and optical disk drive 
30 are connected to the system bus 23 by a hard disk drive interface 32, a magnetic 
disk drive interface 33, and an optical disk drive interface 34, respectively. The drives 
and their associated computer-readable media provide nonvolatile storage of 
computer-readable instructions, data structures, program modules and other data for 
the computer 20. It should be appreciated by those skilled in the art that any type of 
computer-readable media which can store data that is accessible by a computer, such 
as magnetic cassettes, flash memory cards, digital video disks, random access 
memories (RAMs), read only memories (ROMs), and the like, may be used in the 
exemplary operating environment. 

[0058] A number of program modules may be stored on the hard disk, 
magnetic disk 29, optical disk 31, ROM 24, or RAM 25, including an operating 
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system 35, one or more application programs 36, other program modules 37, and 
program data 38. A user may enter commands and information into the personal 
computer 20 through input devices such as a keyboard 40 and pointing device 42. 
Other input devices (not shown) may include a microphone, joystick, game pad, 
satellite dish, scanner, or the like. These and other input devices are often connected 
to the processing unit 21 through a serial port interface 46 that is coupled to the 
system bus, but may be connected by other interfaces, such as a parallel port, game 
port, or a universal serial bus (USB). A monitor 47 or other type of display device is 
also connected to the system bus 23 via an interface, such as a video adapter 48. In 
addition to the monitor, computers typically include other peripheral output devices 
(not shown), such as speakers and printers. 

[0059] The computer 20 may operate in a networked environment using logical 
connections to one or more remote computers, such as remote computer 49. These 
logical connections are achieved by a communication device coupled to or a part of 
the computer 20; the invention is not limited to a particular type of communications 
device. The remote computer 49 may be another computer, a server, a router, a 
network PC, a client, a peer device or other common network node, and typically 
includes many or all of the elements described above relative to the computer 20, 
although only a memory storage device 50 has been illustrated in FIG. 11. The logical 
connections depicted in FIG. 11 include a local-area network (LAN) 51 and a wide- 
area network (WAN) 52. Such networking environments are commonplace in office 
networks, enterprise- wide computer networks, intranets and the Internet, which are all 
types of networks. 



lee&hayes poc S09-324.9&6 



19 



307820.01 MS1- 1938US 



[0060] When used in a LAN-networking environment, the computer 20 is 
connected to the local network 51 through a network interface or adapter 53, which is 
one type of communications device. When used in a WAN-networking environment, 
the computer 20 typically includes a modem 54, a network adapter, a type of 
communications device, or any other type of communications device for establishing 
communications over the wide area network 52. The modem 54, which may be 
internal or external, is connected to the system bus 23 via the serial port interface 46. 
In a networked environment, program modules depicted relative to the personal 
computer 20, or portions thereof, may be stored in the remote memory storage device. 
It is appreciated that the network connections shown are exemplary and other means 
of and communications devices for establishing a communications link between the 
computers may be used. 

[0061] In an exemplary implementation, a virtual image synthesis module, an 
artifact removal module, an image distancing module, a disparity/occlusion map 
generator, and other modules may be incorporated as part of the operating system 35, 
application programs 36, or other program modules 37. Virtual image data, image 
data, image color distances, map data, and other data may be stored as program 
data 38. 

[0062] The embodiments of the invention described herein are implemented as 
logical steps in one or more computer systems. The logical operations of the present 
invention are implemented (1) as a sequence of processor-implemented steps 
executing in one or more computer systems and (2) as interconnected machine 
modules within one or more computer systems. The implementation is a matter of 
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choice, dependent on the performance requirements of the computer system 
implementing the invention. Accordingly, the logical operations making up the 
embodiments of the invention described herein are referred to variously as operations, 
steps, objects, or modules. 

[0063] The above specification, examples and data provide a complete 
description of the structure and use of exemplary embodiments of the invention. 
Since many embodiments of the invention can be made without departing from the 
spirit and scope of the invention, the invention resides in the claims hereinafter 
appended. 
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